Search CORE

5 research outputs found

Word Sense Disambiguation using a Bidirectional LSTM

Author: Kågebäck Mikael
Salomonsson Hans
Publication venue
Publication date: 01/01/2016
Field of study

In this paper we present a clean, yet effective, model for word sense disambiguation. Our approach leverage a bidirectional long short-term memory network which is shared between all words. This enables the model to share statistical strength and to scale well with vocabulary size. The model is trained end-to-end, directly from the raw text to sense labels, and makes effective use of word order. We evaluate our approach on two standard datasets, using identical hyperparameter settings, which are in turn tuned on a third set of held out data. We employ no external resources (e.g. knowledge graphs, part-of-speech tagging, etc), language specific features, or hand crafted rules, but still achieve statistically equivalent results to the best state-of-the-art systems, that employ no such limitations

arXiv.org e-Print Archive

Chalmers Research

Chalmers Publication Library

Word Sense Embedded in Geometric Spaces - From Induction to Applications using Machine Learning

Author: Kågebäck Mikael
Publication venue: Chalmers University of Technology, Gothenburg
Publication date: 01/01/2016
Field of study

Words are not detached individuals but part of a beautiful interconnected web of related concepts, and to capture the full complexity of this web they need to be represented in a way that encapsulates all the semantic and syntactic facets of the language. Further, to enable computational processing they need to be expressed in a consistent manner so that similar properties are encoded in a similar way. In this thesis dense real valued vector representations, i.e. word embeddings, are extended and studied for their applicability to natural language processing (NLP). Word embeddings of two distinct flavors are presented as part of this thesis, sense aware word representations where different word senses are represented as distinct objects, and grounded word representations that are learned using multi-agent deep reinforcement learning to explicitly express properties of the physical world while the agents learn to play Guess Who?. The empirical usefulness of word embeddings are evaluated by employing them in a series of NLP related applications, i.e. word sense induction, word sense disambiguation, and automatic document summarisation. The results show great potential for word embeddings by outperforming previous state-of-the-art methods in two out of three applications, and achieving a statistically equivalent result in the third application but using a much simpler model than previous work

Chalmers Research

Chalmers Publication Library

Extractive Summarization using Continuous Vector Space Models

Author: Dubhashi Devdatt
Kågebäck Mikael
Mogren Olof
Tahmasebi Nina
Publication venue
Publication date: 01/01/2014
Field of study

Automatic summarization can help users extract the most important pieces of information from the vast amount of text digitized into electronic form everyday. Central to automatic summarization is the notion of similarity between sentences in text. In this paper we propose the use of continuous vector representations for semantically aware representations of sentences as a basis for measuring similarity. We evaluate different compositions for sentence representation on a standard dataset using the ROUGE evaluation measures. Our experiments show that the evaluated methods improve the performance of a state-of-the-art summarization framework and strongly indicate the benefits of continuous word vector representations for automatic summarization

Crossref

Chalmers Research

Chalmers Publication Library

A Discriminative Convolutional Neural Network with Context-aware Attention

Author: Chen Qian
Cheng Jianpeng
Cuturi Marco
Devlin Jacob
Hermann Karl Moritz
Joulin Armand
Juan
Kalchbrenner Nal
Kim Yoon
Kingma Diederik P.
Kusner Matt
Kågebäck Mikael
Lei Tao
Li Jian
Lin Chin-Yew
Lin Zhouhan
Liu Baoyuan
Mihalcea Rada
Narayan Shashi
Rehurek Radim
RoyChowdhury Aruni
Tai Kai Sheng
Vaswani Ashish
Wang Yaming
Williams Travis
Xue Wei
Zhang Tianyang
Zhang Xiang
Zhang Ye
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref